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FUNCTIONAL ERROR ESTIMATORS FOR THE ADAPTIVE 


DISCRETIZATION OF INVERSE PROBLEMS 
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Abstract So-called functional error estimators provide a valuable tool for reliably esti¬ 
mating the discretization error for a sum of two convex functions. We apply this concept 
to Tikhonov regularization for the solution of inverse problems for partial differential 
equations, not only for quadratic Hilbert space regularization terms but also for nonsmooth 
Banach space penalties. Examples include the measure-space norm (i.e., sparsity regulariza¬ 
tion) or the indicator function of an L“ ball (i.e., Ivanov regularization). The error estimators 
can be written in terms of residuals in the optimality system that can then be estimated by 
conventional techniques, thus leading to explicit estimators. This is illustrated by means 
of an elliptic inverse source problem with the above-mentioned penalties, and numerical 
results are provided for the case of sparsity regularization. 


1 INTRODUCTION 

Variational regularization often leads to minimizing a sum of two convex functionals and 
discretization is usually performed by restricting minimization to a finite dimensional subspace. 
For inverse problems in the context of large scale PDF models, adaptive refinement of the 
computational mesh is crucial for an efficient numerical solution. Recent contributions to the 
topic of adaptive discretization of inverse problems can be found in, e.g., [24] on adaptive finite 
volume discretizations for Tikhonov-TV regularization, [32, 38] on moving mesh regularization 
and adaptive grid regularization, [9, 10, 14] on refinement and coarsening indicators, and [3, 6-8, 
23, 29-31] on goal oriented error estimators. 

A key step for adaptive discretization is reliable estimation of the discretization error using 
quantities available in the numerical computations, i.e., in an a posteriori fashion. The functional 
error estimators described in [42] allow for an exact estimate of the discretization error and 
appear to be particularly promising for Tikhonov regularized inverse problems since they have 
originally been developed in the context of minimization of a sum of two convex functionals. 
Yet so far they have not been considered for inverse problems and only very recently for control 
problems in, e.g., [22, 34, 48]. Regarding nonsmooth problems, functional error estimates have 
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been used to derive a posteriori error estimators for the finite-element discretization of total 
variation denoising in [4] . 

In this work, we are concerned with linear inverse problems for PDEs consisting of the 
forward model 

(1.1) Ay — Bu 
together with the measurement equation 

(1.2) Cy=g 

where u is the unknown parameter (e.g., source term, boundary data, or coefficient), y is the 

corresponding state solving (1.1), g is the observable data, A : J/ —> AV*, il : TY — > fiV*, and 
C ■. y ^ Q are linear operators, and Q, YY, AV, and d/ are Banach spaces. 

As a simple motivating example, consider the inverse problem of electroencephalography 
[20], which consists in recovering the current density distribution within the brain from poten¬ 
tial measurements on the scalp. This can be formulated (assuming constant conductivity for 
simplicity) as an inverse problem for the PDE 

j-Ay = XcocU in Q, 

I = f on dQ, 

where u is the desired current density, Wc c Q denotes the region of interest inside the skull Q, 
and / is the given current flux on the scalp dQ. The measured data is ^ = y|r, where E c dQ 
denotes the location of the electrodes on the scalp. Here, A is the negative Laplace operator, B is 
the extension operator from coc to Q, and C is the Dirichlet trace operator on E. 

In practice, only a noisy measurement g^ will typically be available, where the noise level S 
defined by 

\\g-g^h<s 

we here assume to be known. Since the solution of such an inverse problems is typically unstable, 
regularization needs to be employed; see, e.g., [21, 45] and the references therein. We will here 
consider the classical Tikhonov-Philips method in Banach spaces with Morozov’s discrepancy 
principle as a regularization parameter choice strategy. 

Using the parameter-to-state mapping 

S - e 


and the reduced forward operator 

K:=CSe L{g 4 ,g), 

we can write (1.1-1.2) as a single operator equation 

Ku = g. 

for this reduced formulation, Tikhonov’s method is given by 

(1.0) mmJa{u,Ku) where Ja{u,g) = G{g)+ 'Ra{u), 
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where is an appropriate regularizing functional and G a discrepancy term, which in this 
work will be assumed to have the form 

(i-4) G{g) = \\\g-9^\\§- 

The discrepancy principle (or rather its relaxed version) amounts to choosing a = a{S) such 
that 

Tj<\\Kul-g^\\g <18 

holds, where is a minimizer of (1.3) and r > r > 1 are fixed constants independent of 5 . 
Convergence of this method has been extensively investigated in the literature; see, e.g., [21] and 
the references therein for an analysis in Hilbert spaces and [12, 45, 47] for a more general setting 
similar to the one considered here. For actual numerical computations, the infinite-dimensional 
problem has to be discretized: Finite-dimensional spaces ^ ^, and 'Wh c TV are 

chosen, and the solution of Ay = Bu is replaced by finding y/, € d 4 such that 

{Ayh - Bu, Wh) = 0 Vwft € TV/j. 

To carry the convergence results over from the infinite-dimensional to the discretized problem, 
the error due to discretization has to be assessed. In particular, it is important to carefully balance 
discretization and regularization. As it turns out, only errors in the functionals G and Ja need to be 
controlled in order to obtain a convergent adaptive method. This makes the theory of functional 
error estimators in [42] applicable. As we will show, these estimators are applicable for different 
choices of regularization functionals. These include the usual squared Hilbert-space norm, i.e. 
gia - f ll'll^^ but also nonsmooth penalties of the form 'Rq. = or Ka = «||-||Ai(a)c)’ 

1/a 

where A 1 (Q) is the space of Radon measures. The latter penalty is useful for incorporating 
sparsity regularization, while the former penalty corresponds to Ivanov regularization (also 
called method of quasi-solutions, see [26-28, 36, 47], as well as [39] in the context of Hilbert 
scales), where the regularization does not take the usual additive form with a as a multiplier. 
In all these cases, the functional error estimators can be computed in terms of residuals in the 
optimality system. 

This work is organized as follows. After fixing some common notation, we present in Section 2 
the basic results on convergence of adaptively discretized regularization methods and the 
functional error estimates our analysis relies on. These estimators are then applied to the 
classical Hilbert space regularization in Section 3, first in the general setting and then specifically 
for a model inverse source problem for the Poisson equation. Similarly, Section 4 and Section 5 
treat the case of Banach space norm constraints and norm regularization, respectively, again 
both in the general setting and for model problems (Ivanov regularization resp. sparsity). For 
the latter, numerical experiments given in Section 6 demonstrate the efficiency of our approach. 


2 NOTATION AND PRELIMINARY RESULTS 

For some Banach space X with dual X*, we use the notation {x*, x)x\x for the canonical duality 
pairing. In case of a Hilbert space X, {xi,X2)x denotes the inner product. Moreover, 5 s denotes 
the indicator function of some set S and B^ the closed ball of radius r around zero in the normed 
space X. 
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2.1 FUNCTIONAL-ANALYTIC SETTING 


In the following, we assume that 'W, 'IV, X, J/ are Banach spaces with 'W and J/ being reflexive, 
and that ^ is a Hilbert space. Furthermore, we suppose that either X = 'if* oiU = X* holds, 
which allows us to use a consistent notation in the rest of the paper and to avoid cumbersome 
case distinctions. For a convex functional F : ' 1 / —> [R, we will denote by 

F* : X ^ R, F*(x) = sup{u,x)<u^x - F(u) 

ue'U 

its Fenchel conjugate. If /'f = 'Ll*, this coincides with the usual definitions in the sense of convex 
analysis. For 'W = /¥*, it is common to define as here the Fenchel conjugate on X instead of X** 
in the special case of F = G* (i.e., the biconjugate of G); the redefinition in the general case is 
less common but still consistent and coincides with the “predual” approach as in, e.g., [i6]. This 
will allow working with spaces of continuous functions instead of the dual of measure spaces 
later on. In particular, the Fenchel conjugate of F{u) - ar||u||'2Y is always given by 


F*(x) = Sgxix) ^ 



if ||v||;t < a, 
if ||x||;^ > a. 


In the case that ’t/ is a Hilbert space, we set X = 'ZY, in which case the duality pairing coincides 
with the standard inner product. In particular, for F(u) = | ||u - z||^ we have 




We further denote by 


dF(u) := {x € 'ZY* : {u — u,x)^^x ^ F{u) - F(u) for all u € 'ZY} 


the convex sub differential of F : 'ZY —> R. Note that we always have the inclusion X c 'ZY*, 
either by equality or by using the canonical injection from X to X**. In the latter case, existence 
of the duality mapping : 'ZY —> df, defined by 

\\j'^(iF)\\x = 1 and {u, {u))<u,x = \\u\\^^ for all u € 'ZY, 


i.e., SF'^iu) e < 5 (||-||<^/)(u), becomes an additional assumption. 

We further need the linear operators A € L(J/,AV*), B e L{'U,'W*), and C e and 

assume that A is continuously invertible. We will also make use of the adjoints 


A* 

€L(TV,d/*) 

with 

{Ay, 

B* 

eL('W,X) 

with 

{Bu, 

C* 


with 

{Cy,g)g 

K* 

em,X) 

with 

(Ku,g)g 

S* 

€L(d/*,A) 

with 

( 5 w,y*>y,y 


(y,A*w)y^y. 

for 

{u,B*w)<u,x 

for 

(y,C g 

for 

{u,K*g*)<u,x 

for 

{u,S y )^l^x 

for 


all y € d/, w € 'TV, 
all u € 'ZY, w € 'TV, 
all y e J/, g e Q, 
all u € 'ZY, g ^ Q, 
all ue%l,y* e J/*. 


Let us emphasize that the existence of B* with the mentioned properties is an actual assumption 
in the case 'ZY = X*, which is equivalent to the assumption that B is the adjoint operator of an 
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operator *B. (With a slight abuse of notation in the first two cases, since these are actually the 
compositions of the standard adjoints with the canonical embeddings W —> W**). In addition, 
{'f^a}a>o, l^a : '1/ —> [R, is a family of proper, convex, lower semicontinuous functionals. 

Finally, let 'if/j, d//,, 'W/, be finite dimensional subspaces of 'if, d/, 'W, respectively. In the case 
that 'W is a Hilbert space, we will denote by the orthogonal projection onto 'W/j. Furthermore, 
R'Wh ■ R^/h ■ the Ritz projectors defined by 

(2.1) {R'Wh^*,^h)^v*^,^Vh - 

2.2 CONVERGENCE OF ADAPTIVELY DISCRETIZED TIKHONOV REGULARIZATION 

We consider the Tikhonov regularization (1.3) equivalently written as a PDE-constrained mini¬ 
mization problem 

(2.2) min Ja(u,y) := l-\\Cy - +'Ra(u) s.t. Ay = Bum'W*. 

ue'U, yey I ^ 

The discrete counterpart of (2.2) reads 

(22) min Ja{u,y) s.t. R^w^i^y - Bu) = 0 . 

' ue'Uh, ys^/h 

Let (u^, y^) be the exact Tikhonov minimizer, i.e., a solution of (2.2), and let (u/j, y^) € 'ZY/j X d/ft 
be some approximation, e.g., a solution of the discrete problem (2.3). In this abstract setting we 
just presume existence of minimizers of (2.2) and (2.3) and will verify this assumption for the 
applications in Section 3.2, Section 4.2, and Section 5.2. The question is now how the convergence 
of the discrete approximation to solutions of the equation Ku = g can be guaranteed for 
(h, a, 5 ) \ 0. 

The following theorem shows (similarly as in [30, 37]) that it is enough to adapt the discretiza¬ 
tion and the choice of the regularization parameter a{ 5 , h) in such a way that the difference in 
the functional values satisfies 


JaiUh,yh)-Ja(Ua,ya) ^ 


and the difference in the discrepancy values satisfies 

(2.4) \\KhUh-g^\\g - \\Kul-g^fg = \\Cyh-g^\\g - ||Cyf -/||| < go, 

where gj and go can be controlled to be small enough relative to < 5 . 

Proposition 2.1. Let (u^,y^) be a minimizer of (2.2) and (u^ h^Va ^ minimizer of (2.3). 
Let a{ 5 ) be chosen such that for some constants Ci, C2, r, r > 0 independent of 5 with ? > r > 
max{Vl + 2 c 2 , 1 -l- Cj}, the estimates 


(2-5) 

(2.6) 


tS < \\Cy^^s),h - ^ 


\\Cy\ 


a(S), h 


■/b-licyf(5)-/b 


< Ci8, 
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and 


(^■7) JaisMaiSlh’ ya{S),h^ 

hold. Then for any solution to Ku = g\ we have 

(2.8) 'Ratsful^s)) <'J<a{ 5 ){u^) and <'Ra{ 5 )i'a^) for alls >Q. 

Moreover, we have 

(2-9) W^VaiSlh ~ /ll@ and - g^\\g < (j + Ct)S 0. 

Proof Set a* — oc( 5 ). By the assumptions (2. 5-2. 7) and minimality of (uf , we have for any 
solution to Ku = g^ 

- CzS^ < - /III - CzS^ 

^ JllCyl - /III +7?„.(<) < l\\Ku^ -g^Wl+'Rafu^) 

< -8'^ +Tia.{u^) < 2{j - Cj)2 ll^^l “^'^ll| 

(where we have used ||Cyf^ - g^\\g > (t- Ci)S in the last estimate), which by comparison of 
the third and the sixth as well as of the first and the fifth expression in this chain of inequalities 
together with r > max{Vl + 2 cz, 1 + Ci} yields (2.8). The convergence (2.9) follows directly from 
(2.5) and (2.6). □ 

Note that no absolute value is required in the estimate (2.7). From (2.8) and (2.9), convergence 
and convergence rates for both the continuous and discrete sequence as d —> 0 follow under 
the usual assumptions on "R, see, e.g., [21, 45, 46]. 

Remark 2.2. Here we have taken into account the fact that in practical computations, the discrepancy 
principle (2.5) can only be checked for the discrete residual ||Cy^ (s) ~ W^ha^ ^ “^"^ll^’ 

not the exact residual \\Ku^ ^ - ^^||^ for which (2.4) can be employed. To bridge the gap 

between these two quantities, we will use the triangle inequality and an additional estimate of 

ll^^/.( 5 ),/i “'^“l( 5 ),/ill@- 

The accuracy requirements that will have to be met by an adaptive discretization are stated in 
assumptions (2.6) and (2.7). Note that for this purpose, the accuracy of u need not be controlled 
directly, but only via the residual norm and cost function values. In the next section, we will 
derive corresponding estimates based on the functional error estimates from [42]. 

2.3 FUNCTIONAL A POSTERIORI ESTIMATORS 

Our approach is based on the following functional error estimate, which is inspired by [42]. 
We employ the strong convexity of the discrepancy term (1.4) to obtain a slightly improved 
estimate. 
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Proposition 2.3. Let (w^, y^) be a minimizer of (2.2). Assume that there is a family of functions 
{<Pa}a>0, (Pa : 'W X ^ [R+, Satisfying 

(2.10) A (1 - X)(paiui, U2) < XKa{Ui) + (1 - X)‘Ra{u2) “ "Ra (M + (1 - A)U2) 

for all U\, U2 € 'U, a > 0 , and A e ( 0 , 1 ). Let v e'Ll and g* e Q be arbitrary. Then, any v eTT and 
g* ^ Q satisfy 

(2-ll) -^)\\g + Va{ui,v) < Jaiv,Kv) - Jaiu^,Ku^) 

< Kaiv) + K(R* 9 *) + G{Kv) + G*{-g*). 

Proof Due to the assumptions and the strong convexity of G, we have for v € TL and A € (0,1) 

^(1 - X) - U)||| + (paiu^,v)^ < XJa{u^,Ku^) + (l - X)Ja{v,Kv) 

- Ja ^Auf + (1 - A)i;, K{Xul + (1 - A)r;) j 
<{l-X){Uv,Kv)-Uul,Ku% 

where we have used optimality of in the last step. Dividing by 1 - A and letting X Z' we 
obtain the first inequality. The second inequality is a consequence of weak duality. □ 

Condition (2.10) is satisfied, e.g., with (pa{u\,U2) = f ||ui - ^2!!^ in the case of a quadratic 
Hilbert space penalty; see Section 3. But we will see that (2.11) still provides valuable information 
on the error if (2.10) is only satisfied with (pa{ui, U2) = 0 , as in the case of Banach space norm 
constraints and penalties; see Section 4 and Section 5, respectively. 

Here it is important to note that the right-hand side of estimate (2.11) does not contain the 
unknown solution We will use this estimate with v := Z ,, which is available in the 
numerical computations. We also point out that the right-hand side corresponds to the duality 
gap between problem (1.3) and its dual problem in the sense of convex analysis; see, e.g., [19]. 
Hence if v and g* satisfy primal-dual extremality relations for (1.3), then the right-hand side of 

(2.11) vanishes. 

The sub- and superscripts a, 5 will be omitted in the following. Instead, we will write (u, y), 
(uh, yh) for the continuous and discrete minimizers (u^, y^), y^ fj, respectively. 

2.4 MODEL PROBLEM 

To illustrate the derived estimates, we will apply them to the identification of the source term u 
in 

, , = inQ, 

(2 ■12) f 

y y = 0 on 5 Q, 

on a domain Q c [R”, n e { 1 , 2 , 3 }, from restricted observations g^ of y in &)<,. Hence, 

A* = A, 

B*w = w|«^, 

c*g = Xojod’ 


Ay ^ - Ay, 
Bu ^ Xc2cU, 

Cy = yU„, 
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and Q = L^(cOo)- In the sequel, we assume that Q is polyhedral and convex. This enables 
us to employ -regularity results for the elliptic equation (2.12). In addition, we can avoid 
technicalities in the finite element setting on curved domains. 

We define d//, = 'Wh by continuous piecewise linear finite elements on a shape regular 
triangulation 7 h consisting of element domains K; see, e.g., [11]. The set of all faces of elements 
will be denoted by £/,. The associated nodal interpolation operator will be denoted by which 
is continuous from Ci,(Q) to d/ft. We will employ the standard interpolation estimates 

11 ^ - ^ Vn e H^(Q), 

^ ^ Wv e H^{Q), 

where hfc is the element diameter, as well as the stability estimate 

(2.14) I|z^IIh 2 (Q) ^ cs||An||ft2(Q) Vu € H\Q) n fdo(Q), 

cf [11, Thm. 11.6.4] [i 5 > Thm. 3.3.7]. 


(VKeTh 

|vT€£ft: 


3 HILBERT SPACE REGULARIZATION 

In this section, we assume that 'Ll is a. Hilbert space, identify df with 'Ll, and consider as regu¬ 
larization term the squared norm, i.e., 

andhence 7 ?^ - 

Since dTa is differentiable, we obtain for (2.2) by standard Lagrangian calculus the optimality 
system 

[ C*iCy-g^)+A*w - 0 , 

(3.1) j au - B*w = 0 , 

[ Ay - Bu = Q. 

The corresponding discrete system for (2.3) is 

ic*(Cyh -g^)+A*Whj = 0 , 

auh - P<u^B*Wh = 0, 

R'Wh i^Vh - Buh) ^ 0 , 

with R^Vh ^s in (2.1), which corresponds to a finite element discretization of the state and 
adjoint equation. The solution (lift, y^, Wft) e 'W/, X d/ft X TVft of (3.2) can be considered as an 
approximation to the solution (ii, y, w) e X d/ X AV of (3.1). 




3-1 ERROR ESTIMATES 

Setting (pa{ui,U2) = f ||Ri - W2||^, we obtain from Proposition 2.3 that the solution u to (1.3) 
satisfies 

(3-3) 

a||M - u\\l^ + \\Ku - Ku\\g < 2 {Ja{u,Ku) - Ja{u,Ku)) 

< a\\u\\\, + hK*g*\\\, + \\Ku - /||| + \\g* - /||| - ||/||| 

- a\\u\\l^ + ^\\K\g - /)||^ + \\Ku - /||| + ||^||| - ||/||| 

for any u etl and g* := g^ - g ^ Q for any g € We now define 
(3.4) y := Suh = A~^Buh. 

Inserting u = Uh and g - Cyh in (3.3), we arrive at 

(3-5) a\\uh - u\\\i + ||Cy - CyWg < 2 {Ja{uh, 9 ) - Ja{u, y)) 

< ^\\auh + S*C*{Cyh - /)||^ + \\C{A-^Buh - yh)\\g 
= ^\\auh - B*Wh + S* ic*(C% - g^) + j ||^ 

+ \\CA-\A%-Buh)\\l. 

Here, (3.5) contains the residuals of the equations in the optimality system (3.1), which are given 
by 

i pw := C*{Cyh - g^) + A*Wh = A*(wh - w), 

(3.6) j Pu := auh - B*Wh, 

[ py := Ayh - Buh = A{yh - y), 

where (y, w) e d/ X AV and {%, w/,) € d4 x AVh satisfy 

I [c*(C9h - g^) + A*-wj^ =0, I C*(Cy/j - g^) + A*w = 0, 

I R^vA^yh - Buh) = 0 , \ Ay-Buh = 0 , 

for the same Uh e (note that the left system is coupled, as opposed to the right one). Thus 
the inequality (3.5) appears to be suited for a posteriori error estimation. 

Although estimate (3.5) only gives an estimate on ffu/j-ffii = Cy-Cy andnot on = 

Cyh - Cy (which is needed for (2.6)), we can use the identity yh - 9 = A“^py,i.e., 

(3.8) Cyh - Cy = CA~^ (Ayh - Buh), 
the triangle inequality, and the fact that 

(3.9) Va, b,c,d > 0 : a + b^ < c + (f a + (b + d)^ < yc + ad^ 
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holds for 


(310) 


((1 = 4 and f > 2) or I c > 4 and 


20- 


(see the appendix for a proof) as well as 

s 1 

(3.11) Ja{uh, %) - Ja{uh, y) = {Cyh -g ,Cyh- Cy)g - -\\Cyh - Cy\\g 
to obtain from (3.5) the following a posteriori estimate. 

Proposition 3.1. Let'Ll be a Hilbert space andgia - ^\\-\\^. Then the minimizers {u,y) of {2.2) 
and {uh, yh) of (2.3) satisfy the estimates 

(3.12) cc\\uh — u\\^ + \\Cyh — Cyll^ < ~\\T {A ) + Pu\\<k + cr||CA PyWg^ 

(3-13) Ja(uh,yh)-Ja(d,y) < + puWh + {Cyh - ,CA~^py)g, 

with a and y as in (3.10) and p^, pu, and py as in (3.6). 

Here the factors ex and y may be used to minimize the right hand side of the estimate. In the 
following, we will fix cr = 4 , y = 2 for simplicity. 

At a first glance, estimate (3.5) requires solution of state and adjoint equation on a fine grid for 
applying S* and CA“\ but this can be avoided in some relevant examples; see, e.g.. Section 3.2 
below. 


3.2 APPLICATION TO INVERSE SOURCE PROBLEM 

We now apply the estimate from Proposition 3.1 to the model problem (2.12). In this case, we 
have 'll - L^{ci)c) as well as d/ = H^{Q) - 'W, and the Tikhonov problem is given by 


S\\2 


a, 


min-||y-^' 

y,u I 

sX.-Ay = Xc^u, yIejQ ^ 0. 




Hence, using 


Pm, = X^o{yh-g ) - 
pu — ~ ^h\oTc'> 

Py = - X^c^h, 


estimates (3.12) and (3.13) become 

2 

(3.14) a\\Uh - u\\lH6Jc) + WVh - yWLHoJo) ^ + m-^y^[Py]\\l 2 (^^^y 

1 „ 

(3-15) Ja{Uh,yh)-Ja{u,y)<—\\{-A) [pw] +Pii|li2(^^) 

+ (yh - (-A)“HPy])L2(^„). 
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It remains to describe how the right-hand sides can be evaluated for a given discrete ap¬ 
proximation {ufi, fh). The residual can be estimated using a conventional error estimator: 
Observing that w/, and w solve the discretized and continuous Poisson equation with the same 
right-hand side C*{Cyh — g^), we can write 

(3.16) (A*)-Vw - - v) = {{Air' - {A*r')C*{Cyh - g^)- 

Hence, using duality-based error estimators, e.g.,from [1, Sec. 2.4], with </? = A~^BB*{wh-w) e J/, 
we obtain 


||B*(A*) Vwll<w = \\B*{wh - w)||4/ = {A(p,Wh - 

= {(p,A*{wh - = {(p- l'^(p,A*{wh - 


where we have used Galerkin orthogonality in the last equality. Since Q is assumed to be convex 
and polyhedral, we can apply (2.13) to (p e H^{Q) to obtain for all ff e 7 ^ the estimate 

\\<P - ~ '^\\L'^(dK\aQ) ^ 

Due to -regularity, we can also apply (2.14) to further estimate \(p\h^{^) < cs||w/j - 
From (3.7) and integration by parts, we thus obtain 

W^h - ^ 

^ A / (<? - J^'^(p)pw dx+ {(p- l'^(p)Vwh -vds] 

JdK I 

^ O ^ |^KllPwllL2(_f:) + -h][^\\l'VWh ■ \(P\h2(K) 


Ken 

/ 


< C7- 


. 1/2 


^ llpwll£,2(jf) + IllVwft • ||w/j M\L2{ai^p 

\Ken 


where C7- := C/Cs, and I-]] denotes the jump over the element boundary dK with normal v. 
Canceling the norm on both sides then yields 


(3-17) ll(-A)“nPw]|lL2(^,) = llWft - *IIl2(«C 

/ 

< C7- 


. 1/2 


'Ti ^^IIPwIIl2 (JC) + 2 
\Ken 


=: Ct //v 


Note that although is globally only an element of H~^{Q), we may take its elementwise ^{K) 
norm, since Wh is piecewise polynomial and therefore A{wh\K) ^ L^{K)- In case of piecewise 
linear finite elements, we just have WXcJoiVh ~ in place of ||pvv»ll^2(jf)- 
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The term containing /?„ is straightforward to evaluate as a sum of elementwise contributions. 
Analogously to (3.16), we have a similar representation for py in (3.8). As in (3.17), we can thus 
estimate 


=: crily. 

Combining (3.14) and (3.15) with (3.17), and (3.18), we thus obtain the explicit a posteriori estimates 

a\\uh - + \\% - < ^IK-ArHpw] + + 4 ||yft - 

< ^ (cr/7w + llpJll2(^^)) + 4 [crriyf , 

Ja{Uh,yh)- Ja{u,y) < ^ (cT-f/w + IIPi/Ili2(^^)) + Cr? 7 y IIT/i “ / lllTo),,) • 

Remark 3.2. The inner product term in (3.15) could in principle lead to a negative estimate of 
7a(^/n y/i) ~ Ja{u, y), which by (2.7) would mean that no refinement is required from the point of 
view of cost functional accuracy. However, so far we have not found a means to reasonably evaluate 
this term as a possibly negative inner product (approximating (-A)“^ by its discretized version 
would just make the term vanish) and thus to estimate it by the Cauchy-Schwarz inequality. 

Estimates (3.17) and (3.18) give bounds on quantities defined on the possibly restricted subdomains 
a)c and cOg, respectively. However, the estimators are sums of contributions on the whole domain Q, 
and the dependence on the subdomains cOc, <Uo only enters indirectly via the definition ofp^, w/,, 
Py, and yfi. Still, this makes sense, since these estimators are supposed to indicate local refinement 
of the finite element mesh for Wh ond yh defined on all ofQ. 

Remark 3.3. Related results can be found in the literature on a posteriori error estimates for optimal 
control problems. We mention [33, 35], where H^-error estimates are used in contrast to the L^- 
estimators employed above. Goal-oriented error estimators of dual-weighted-residual type are 
investigated in, e.g., [5, 23, 30, 31]. 


(3.18) 


Wn - ylliT^o) ^ 



4 llPy 


+ -h^ 
kHK) ^2 




4 BANACH SPACE NORM CONSTRAINT 

In this section, we consider as regularization term 

- dg'u , and hence = — IHU- 

This setting is of particular interest for incorporating pointwise almost everywhere bounds on 
u via 'll = L°°(cOc); see Section 4.2 below. Let us recall that in the setting Tj - X*, the operator 
B is explicitly assumed to be an adjoint operator, which is the case in the example considered in 
Section 4.2. 
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Using the definitions of Section 2.1 and standard arguments from convex analysis, we obtain 
for (2.2) the optimality conditions 


:,'U 


C\Cy - g^) + A*w = 0, 
and {u - u,B*w)<u,x ^ 0 ^ 

Ay — Bu = 0 . 

The corresponding discrete optimality conditions are 

R:gh [c*iCyh - /) + A*Wh^ = 0 , 

Uh e and {uh - Uh,B*Wh)<u,x <0 Vu/, e b"^^, 

R<Wh i^Vh - Buh) = 0 . 


(4-1) 


,1, 


4.1 ERROR ESTIMATES 

Setting (pa{ui, U 2 ) - 0 , we obtain from Proposition 2.3 that the solution u to (1.3) satisfies 
\\Ku-Ku\\g < 2 (Ja(u,Ku) - Ja(u,Ku)) 

< hK*g*\\x + \\Ku - /III + 11 / - /III - ll/lll 

= ^ \\K\g - /)|U + 2{U, K\g - /))^,^ + \\Ku - ^||| 

for any u € ’tl and g* := g^ — g € Q for any g € Q. Inserting u = Uh and g = Cyh with 
(uh, fh, w/i) £ ’Uh X d 4 X AVh satisfying (4.1), we obtain 

(4.2) ||Cy - Cy||| < 2 {Ja{uh, y) - Ja{u, y)) 

< -||B*w|U - 2{uh,B*w)a^,x + \\CA-\Ayh - Buh)\\L 

a ^ 

with y and w defined as in (3.4) and (3.7), respectively. Note that by ||uft||<w ^ / the term 
^||B*w||;^’ - {uh,B*w)>u^x is indeed nonnegative. 

For the first and last relation in (4.1), we can define the residuals and py as in (3.6) and, 
taking into account (3.8)-(3.ii), obtain a first a posteriori estimate. 

Proposition 4.1. LefRa - 8 gv . Then the minimizers {u,y) of (2.2) and {uh,yh) of (2.3) satisfy 

l/a 

the estimates 

\\Cyh - Cy||| < -||B*w||^ - 4 {uh,B*w)<u^x + 4 ||CA“Vy|||, 

Ja{uh,yh) - ]a{u,y) < ^\\B*w\\x - {uh,B*w)ti^x + ||CA“Vy||^||Cyfe -/||^. 
with Py as in (3.6). 
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If a duality mapping J^^{x) e 5 |HU(x) exists (e.g., iitl — ^Y*), we can also define a residual 
for the second relation in (2.3) by 

(4.3) pu ■- auh - 

From B *it follows that {pu,B*Wh)t(,x ^ 0- Then we can 

estimate 

\\B*w\\x + a{uh, B*w)<u,x - {-pu - {J^{B*'Wh) - J'^(B*w)), B*w)<u^x- 
By construction we have that 

{J'^(B*Wh) - J^^iB*w),B*Wh)'U,X - (B*w), B*■Wh)ti^x ^ 0 

Hence it follows that 

{J^iB*w) - J^{B*Wh)),B*w)a^,x < {J^iB*w) - J^{B*Wh)),B*(w - Wh))<u,X. 
Introducing the symmetric Bregman distance of || • ||<^ defined as 

Bj^^{B*w,B*Wh) {J^iB*w) - J^iB*Wh)),B*{w - v,h))<u,x, 
we obtain the estimate 


\\B*-w\\x-a{uh,B*^)a4,x < {-pu,B*<^)^,,x + ^"{"^{B*-w,B*-Wh). 

Using (3.8) in (4.2) together with the definitions of pu and Dji^y^ yields the following estimates. 
Proposition 4.2. Let'U = <Y* and'Ra = 5 gu . Then the minimizers {u, y) 0/ (2.2) and{uh,yh) of 

l/a 

(2.3) satisfy the estimates 

(4.4) \\Cyh - CyWg < U-Pu,B*w)a^,x + (B*w) + 4 ||CA-Vy|||, 

(4-5) Ja(uh,yh) - Ja(u,y) < -{-pu,B*w)<u^x + (B*w,B*Wft) 

a a ii'ilv 

+ \\CA-^Py\\g\\Cyh-g^\\g, 

with py as in (3.6) and pu as in (4.3). 

Let us remark that due to (3.6), the unknown w can replaced by w/j - {A*)~^p^. Hence, the 
components of the error estimate are fully available in numerical implementations, as we will 
show in more detail in Section 4.2. 

If a variational discretization, i.e., TTh = 'W, is used, then from (4.1) we obtain B*Wh € 
dSs •w {ufi), which is equivalent to Ufi e £l|| • \\x{B*'Wfi). This implies that p^ = 0 , and hence (4.4) 

l/a 

and (4.5) reduce to 

(4.6) IlCy, - Cy||| < + 4 ||CA-Vy |||, 

(4-7) Ja{uh,%) - ]a{u,y) < (B*w,B*Wft) + \\CA~^py\\g\\C%-g^Wg. 
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4-2 APPLICATION TO INVERSE SOURCE PROBLEM 

We now apply the estimate from Proposition 4.2 to the model problem (2.12) for the case of 
Ivanov regularization. In this case, we have ' 1 / = L°°{cOc) and = L^{a)c), i.e., 'ZY = ?(*, and 
hence the duality mapping is given by 

= sign(w|„J. 

As before, we take d/ = Hq{Q) = 'W. The Ivanov problem is then given by 

! min^||y-^'^||f2/,, ^ s.t. |u(x)| < - for a.e. x e tOc 
y,u 2 ^ a 

and - Ay ^ Xcj,u, yl^o = 0 . 

The residuals used in Proposition 4.2 are now given by 

Pw ■= -/) - Awft, 

Pu := auh - sign(wft|^J, 

Py ■= -Ay/i - Xi^Uh- 

We will consider the case of variational discretization for simplicity, where we can make use 
of the estimate (4.6). Since the term containing py in (4.6) can be estimated by (3.18), it only 
remains to consider the term containing which in this setting can be estimated by 

1 2 
= - (sign(w) - sign(w/j), w - -^h) 

(Note that we cannot expect smallness of ||sign(w) - sign(w/j)||x,<»(^j^) directly, since continuity 
of the sign operator cannot be quantified on A = L^{a)c).) 

In order to estimate the L^((Uc)-norm of w - w/j, we introduce 

z := sign(w - Wft)] € W^’^(Q) n ff^(Q) Vp < 00. 

We assume from here on that Q c IR^ is polygonal with interior angles of at most In this 

case, we obtain from [18, Thm. 1] that 

( 4 - 8 ) l|z|lw2.P(o) < csp ||sign(w - Wh)\\L-’(Q) <csp 

holds for allp > 2 with a constant cs > 0 independent ofp. In case that Q does not allow for 
such a regularity result, and (4.8) only holds for p = 2, we can use the L^-error estimate of 
Section 3.2. 

Let I^z be the piecewise linear interpolant of z. Then we have from [15, Thm 3.1.6] together 
with (4.8) for allp > d the estimate 

(4.9) ||z-+ \\z - z\\ic.(aK) < Clh^^‘^'^\\z\\w2,P(K) ^ CiCsph^^'^^^ 
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with a constant c/ > 0 depending only on the chosen finite element family. Using the definition 
of z, we obtain 

11* - = (Vz, V(w - Wft))i 2 (f 2 ) 

= (V(z - V(w - Wft))i2(Q) 

= -(z - - (V(z - I'^z), VWft)x, 2 (Q), 

where we have used Galerkin orthogonality and the fact that the interpolation operator : 
Cfo(Q) —> d/ft can indeed be applied to z € W'^’^(Q) D Hq{Q) Cft(Q). Here and below, (•, ■)i2 
denotes the inner product. Now we integrate by parts on each element to obtain 

II* - Wft||i,i(^^) ^ ^ Xa>o(yh - g^))L\K) 

KeTh 

- / Wwh ■ v(z - I'^z) ds) 

JdK ' 

- Yj (ll^“-^^^lli“W ll“^*ft + A'«o(y/i-/)llu(if) 

KeTh 

+ l|z - J^z||i~( 5 ir\ 5 Q)|||[Vwft • 

o d , 

< C/Cs 'Yj Pk (||-A*/I + XoJoiyh - 5^)llu(J<‘) + III^*/! • ^^]lllu(5M-9f^)) 

KeTh 

where we have used (4.9) with pK > d individually for each element K e 7 h. (As in (3.17), the 
term Awft vanishes in case of piecewise linear finite elements.) Choosing now px ~ d \ log(hj;f)| 
yields 

II*-Wft||i,i(„j < cics Y llog^^l^^ (IIPwIIli(jc) + IllVwft • =: crr]^,. 

KeTh 

With the help of this residual-based error estimate and of (3.18), the error estimates (4.6) and 
(4.7) can be computed. 


5 BANACH SPACE NORM REGULARIZATION 

In this section, we consider as regularization term 

- cf|H|7/, and hence “R* = 

This setting is of particular interest for promoting sparsity of u via 'W = Af(Q); see Section 5.2. 
Again, in case gj = X* we explicitly assume that B is an adjoint operator. 

As above, we obtain for (2.2) the optimality conditions 

[ C*(Cy - /) -t A*w - 0, 

(S-i) j B*w € B^ and {u,u* - B*w)%(^x ^ 0 Vu* € B^, 

[ Ay - Bu - 0 . 
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We again consider a discretization of this system. In the following, let (y^, Uh, w/,) he a discrete 
approximation of the solution of (5.1) given hy 


(5-2) 


iR^^(c*(CyH-g^)+A*WH)=0, 

[ R'Wh (Ayh - Buh) = 0, 


together with a discretization of the second relation of (5.1), which however is intimately linked 
to the choice of the space 11 ^ and the discrete approximation of B*w € B^. We refer to Section 5.2 
concerning details for the specific choice 'W = A 1 (Q). 


5.1 ERROR ESTIMATES 

Setting again (pa{ui, U2) = 0 , we obtain from Proposition 2.3 that the solution u to (1.3) satisfies 

\\Ku - Ku\\l < 2 Uaiu,Ku) - Jaiu,Ku)) 

< 2 a||u|k + \\Ku - /III + 11 / - /III - ll/lll 
= 2 a\\u\\^t + 2 {u,K*(g - + \\Ku - g\\g. 


for any u €’t( and g* := g^ — g e 0 for any g ^ Q such that K*g* € B^. 

Similarly as before, we set u = Uh- However, the choice g - Cyt is not possible, as K*{Cyh - 
g^) ^ B^ in general. Hence, we introduce a scaling factor k > 0 such that g — g^ = K(Cyh — y^) 
satisfies 

KK*{Cyh - g^) = KB*{A*y^C*{Cyh - g^) = kB* w e B^ 
with w as in (3.7). It thus suffices to choose 


(5-3) 


K = min 


a I 

W^’ T 


The estimation of k will be discussed below; see (5.6) and (5.8). Inserting u - Uh and g = 
kC% + (1 - K)g^ with (uh, %, w/i) e x d 4 x 'Wh satisfying (5.2), we obtain 

l|Cy-Cy||| < 2 {ja{uh,y)- Ja{u,y)) 

< 2 a\\uh\\nj - 2 {uh,KB*w)ru,x + \\CA~\Ayh - Buh) + (k - l)(Cyft -/)||| 

- 2 {a\\uh\\ti - {uh,B*Wh)tl,X + K{uh,B*{wh - w))'ZY,A 
+ (1 - K){uh, B*Wh)ti,x) + \\CA~^py + (k- l)iCyh - 5 '^)|||- 

Note that by dual feasibility of jcw, the term a||uft||<^/- (uft, kB*w)^^x is nonnegative. Estimating 
again the terms on the right-hand side using (3.8) and (3.9) with a - 4 and y = 2 , we obtain the 
following a posteriori estimate. 
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Proposition 5.1. Let'Ra = cr||-||<w. Then the minimizers {u, y) of (2.2) and{uh,yh) of (2.3) satisfy 
the estimate 


( 5 - 4 ) \\Cyh-Cy\\g < ^{aWuhW'u - {uh,B*Wh)'U,x) + ^K{uh,B*{wh - w))<u,x 

+ 4(1 - K){uh, B*Wh)<u,X + 4||CA"Vy + (k - T)iCyh - g^)\\^g, 
( 5 - 5 ) Ja{Uh,y) - ]a{u,y) < {a\\Uh\\%i - {Uh,B*'Wh)a4,K) + K{Uh,B*{Wh - w))<u,x 

+ (1 - K){Uh,B*Wh)'U,X 

+ \\CA-^Py + {k- \){Cyh - g^)\\g\\Cyh - g^\\g. 


with py as in (3.6) and k satisfying (5.3). 

If a duality mapping ff'^{u) € ^|H|'Zy(u) exists (e.g., if <Y = TA*), we could again define a 
residual for the discrete version of the second relation in (5.1) via pu — B*Wh £^nd 

proceed similarly as in Section 5.1. Since this will not be the case in the example below, we do 
not do so here. 

The quantity 1- k can be estimated by 


(5-6) 


1 - k: < max 1 


\\B*mx 


, 0 < max 


\\B*Wh\\x - a + \\B*(wh - w)IU 
\\B*Wh\\x + \\B*(wh - w)|U 


, 0 


This bound can be written in terms of the residual p^ as 


1 — K < max 


\\B*wu\\x-oc+\\B*{A*r^pAx \ 

\\B*WH\\x + \mA*)-^p^\\x ’ I 


which implies that the quantity 1 - k: is a combination of the violation of the dual constraint 
^ 0: and the residual p^. Thus we can expect 1 - k: to be small for a sufficiently fine 
discretization. We refer to [44] for a related error estimate for state-constrained optimal control 
problems. 


5.2 APPLICATION TO INVERSE SOURCE PROBLEM 

We now apply the estimate from Proposition 5.1 to the model problem (2.12) for the case of sparsity 
regularization. In this case, we have TT - A 4 (c 0 c) and = C[,{cOc), i e-, Ad = A*. Due to the low 
regularity of the source term, we here set d/ = (Q) and AV = Wg’‘^(Q), where q' = ^ 

with n < q < to guarantee (Q) c L^(Q). The operator B : Ai(cOc) —> is 

defined as 

with B*w = w\^^. The Tikhonov problem is then given by 

1 c 2 

IIl2(^„) + “II“IIm(«c) 

s.t. - Ay = y\dQ = 0. 




18 





-la' 

From [17], we have existence of a minimizer u € M.{cOc) as well as an optimal state y € VFq ^ 
and an adjoint state w e Wq’‘^(Q) satisfying the optimality conditions 

(-Aw + Xcooiy - 9^) = 0, ybQ = o 

j \M\cb(<^c) ^ “ and {Uh,w- -^)M(a,,),Ct{c.,) ^ 0 V||w||c^,(„,) < a, 

I - Ay - = o> y\aQ = 0. 

As Q is convex and polyhedral, we can employ -regularity results. We take here as well 
d4 c d/ and 'Wh c 'W as piecewise linear finite elements, and thus the residuals in the first 
and third relation are once more given by 

Pm, = X(„oiyh - 9^) - AWft, 

Py = -Ayft - XcjM- 

We again use a variational discretization 9 Ah = h was shown in [13] that the corresponding 
semi-discretization of (5.7) admits a unique minimizer of the form Uh = Y!j=i UjSxj, where 8 x 
denotes the Dirac measure concentrated on x € Q and {xj}j^‘j are the interior vertices of Tj, 
lying in o)c- Hence, we have that 


a\\Uh\\M(oyc) = {^h,'Wh)M(ayc),Cb(o,c) ' 


SO that the first term on the right-hand sides of (5.4) and (5.5) vanish. Furthermore, from [13] 
we have that 

Nc 

{Uh,'^h)M(a)c\Cb{i^c) ~ 'Y, 

7=1 

for any Wh - Ylj=i where ej is the piecewise linear finite element basis functions corre¬ 
sponding to the vertex Xj. 

To estimate the term A~^py, we use the residual error estimator for Dirac measure data from 
[2] (note that here Pylx i L^{K))- There exists a constant C2 > 0 independent of h such that 


Wyh - ylliT^o) ^ 


C2 


y/2 

i; ■ ^]lllL2(5K\an) 

\KeTh I 


029 y 
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The term {uh, Wh - M(cdc) Ci,(coc) the right-hand side of (5.4) can he estimated as 


Nh Nh 

{Uh, Wh - ■(^)MM,Cb{o^c) " 2 Uj{Wh{Xj) - w(Xy)) - ^ Uj(Wh(Xj) - 

7=1 7=1 

“ / Vy/jV('W/, - dx = / VyftV('\v - X^w) dx 

Jq Jq 


^ j'dK 


Wyh ■ v{w - X^w) ds 


KeTh 


/ 


< C2C1 


vl/2 


2 ] himn 

\KeTh 


lll2(5Jf\5Q) 


|w|h2 


(O) 


< C 2 C 7 CS 


, 1/2 


2] ^IllVyft • I Ilyft - 

\KeTh 


where we have used the definition of yh in the third equality, the definition of w/, and w in the 
fourth equality, and elementwise integration by parts, elementwise linearity of yh in the fifth 
equality, as well as (2.13) and (2.14). 

In order to estimate 1 - x, we apply the L“(Q) residual error estimator of [40]; see also [44], 
which is valid even for nonconvex polyhedral domains. It was proven in [40] that there exists a 
constant c > 0 depending on Q and the shape regularity of the triangulation such that 

l|wft - *IIl»(«,) < c\\ogh,^n\'^ max (hl.\\-Awh +X^oiVh-g^)\\L«{K) 

KeTh ' 

+ ^xlll[Vw/; • c/;“, 

where h^i„ minj^gT;;^ h^- Inserting this into (5.6), we obtain 


(5.8) 


K < max 


\\B*Wh\\x -a + ct]° 


, 0 


\\B*Wh\\x +cril 

Collecting all the results, we obtain from Proposition 5.1 the a posteriori estimates 


(5-9) Wyh - yllL2(«„) ^ 403/7^ -t 4/7^ {Uh, ^h) Mihj.XChia,,} 

+ 4(c2t]y + t]„\\yh - , 

(5.10) JaiUh, yh) - Jaiu, y) < C3/7W + /?«: {Uh, ^h) M(a>c),Ch(c^c) 

+ 4(c2t]y + rjAn -g^\\LHc^„))\\yh - 

Remark 5.2. A posteriori estimators for a state-constrained control problem can also be found 
in [43]. This control problem is related to the dual problem to (5.7), which takes the form of a 
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state-constrained problem without a discrepancy term. (Conversely the dual to the problem in 
[43] involves a Huber norm in place of the measure-space norm in (5.7).) Furthermore, in [43] the 
state constraint is penalized, which manifests in an additional penalty in the dual problem. The 
resulting error estimator then gives combined bounds on the regularization and the discretization 
error. 


6 NUMERICAL EXAMPLE 


We illustrate our error estimators with numerical results for the example from Section 5.2. In 
order to have available an exact analytical solution, we use the example from [41, Section 8.1]: 
Setting Q = coc = cOo = BfO) c [R^, we have that -Ay^ = u^ for 

W = In (max {p, |x|2}), u'*' ^ (0), 

ZTT ZTtp ^ 

where p e ( 0 , 1 ) is arbitrary and denotes the one-dimensional Hausdorff measure. Further¬ 
more, ii = u^ = u^ is the minimizer of (5.7) for given a > 0 if the data is chosen as 


c X 

g (x) =-—ln(max{p, |x|2})-ta(p(|x|2) 

ZTt 


with 


( 6{3r-2p) 

‘?’(^) “ i 6{3r^-2rp-2r+p) 
[ ip-lpr 


for r < p 
for r > p. 


In the following, we set p = 0.5 and a = 10 “^ unless specified otherwise. The corresponding 
discrete approximations Uh are computed using the approach from [13]. 

We first illustrate Proposition 5.1 by comparing in Figure 1 the errors in residual and functional 
value to the terms in (5.9) and (5.10) for a sequence of adaptively refined meshes for uniform 
refinement (Figure la) as well as for adaptive refinement using the procedure described in [44] 
(Figure ib). We also show to the rate 0 {h^), which up to a logarithmic factor is known to hold 
for the residual and Tikhonov functional error; see [41, Thm. 6.2]. This rate also seems to be 
satisfied for our estimator. 

To illustrate Proposition 2.1, we consider g := y^ as exact data, add Gaussian noise at different 
levels 8, and adaptively compute the corresponding minimizer Specifically, we start 

from a relatively large ao = 10 “^ and coarse uniform mesh. In an outer loop, we then reduce the 
regularization parameter for 0 = 0.6 until the discrepancy principle (2.5) with T = 2 is 

satisfied. In an inner loop, we adaptively refine the discretization according to the error estimator 
from Proposition 5.1 until the precision requirements (2.6) and (2.7) from Proposition 2.1 are 
satisfied. The resulting residuals, regularization parameters and functional values for different 
noise levels are plotted in Figure 2 and show a convergence rate of 0 ( 5 ). 


7 CONCLUSION 

Reliable estimators for the discretization error in Tikhonov regularization can be computed 
using the approach from [42] . Combining this with a general result on convergence of discrete 
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■ l/W -7(wft)l 

■ \\y-yh\\ 

A 


(a) uniform refinement 



Figure n Comparison of true error and estimator 
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Figure 2: Illustration of convergence rates as 5 —> 0 


approximations and an appropriate adaptive mesh refinement strategy yields convergence of 
these approximations to a solution of the inverse problem. The approach can in particular be 
applied to the Banach-space setting required for sparsity enhancement or Ivanov regularization. 

These error estimators can be incorporated into a local refinement strategy for mesh adapta¬ 
tion. As shown in the examples, the estimators can be written in terms of sums over the element 
domains (or their interfaces) of a triangulation. Thus it makes sense to subdivide elements with 
relatively large contribution to the error estimator. Note that using variational discretizations 
according to [13, 25], we do not refine independently for parameter, state, and adjoint, but use a 
common mesh for all three quantities. 

Future research will be devoted to transferring this approach to nonlinear inverse problems 
via iterative linearization similarly to [29] as well as to all-at-once approaches based on the 
model-and-measurement formulation (1.1-1.2). 


APPENDIX 

In this appendix we prove that the implication (3.9) holds for cr and y chosen according to (3.10). 
Since a + b^ < c + S is equivalent to a + {b + df" < 2 bd -I- c -I- 2 d^ for b,d > 0 , the implication 
(3.9) is equivalent to 

(a.i) Vc, d > OWO < b < Vc -I- d^ : 2 bd < (y - l)c + {cr - 2 )d^, 
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which (with b = 0 ) can only be satisfied if y > 1 and cr > 2 . It suffices to consider in (a.i) strictly 
positive d, so that upon division by and setting x - ^ and y - the conclusion of (3.9) is 
equivalent to 


o'> 2 Ay>lAVy> 0 Vx€ 0 , yj'i. + y ■ 2 x < (y - l)y + a - 2 . 


This obviously holds iff 


cr >2 A y>l A Vy> 0 : 2 -y^lT^ ^ (y ~ l)y + cr - 2 . 


Setting z - yi + y - 1, this is equivalent to 

c7>2 a y>l A Vz> 0 ; 2 z<(y- l)((z + 1 )^ - 1 ) + cr - 4 . 

For z - 0 , this implies cr > 4 , and hence (3.9) is equivalent to 

(t >4 a y>l A Vz> 0 : 0 <(y- l)z^ - 2(2 - y)z + cr - 4 . 

We first consider the right-hand side as a quadratic polynomial in z, whose roots are given by 
z+ = (y - 1)“^(2 - y + Vd) for D := (2 - y)^ - (y - l)(u' - 4 ) = y^ - ycr -1- cr. We thus arrive at 
the equivalent condition 



Considering now D as a quadratic polynomial in y with roots 


<7 + VcT^ - 4 ct 



we arrive at 

0- > 4 A y > 1 A (y_ < y < y+ V [(y > y+ V y < y_) A y > 2 A (y - 2)2 > D]) . 

For (T > 4 and y > 1 , it can be easily checked that (y - 2 )^ > D and 1 < y_ < 2 hold (the latter 
with strict inequality if n > 4 ). We have thus shown that (3.9) holds iff the conditions (3.10) are 
satisfied. 
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